Python for Data Analysis
Table of Contents
Import Conventions
The Python community has adopted a number of naming conventions for commonly- used modules:
import numpy as np import pandas as pd import matplotlib.pyplot as plt
pandas
The pandas name itself is derived from panel data, an econometrics term for multidi- mensional structured data sets, and Python data analysis itself.
CHAPTER 2 Introductory Examples
1.usa.gov data from bit.ly
downloadable text files: http://www.usa.gov/About/developer-resources/1usagov.shtml
MovieLens 1M Data Set
GroupLens Research (http://www.grouplens.org/node/73) provides a number of collec- tions of movie ratings data collected from users of MovieLens in the late 1990s and early 2000s.
US Baby Names 1880-2010
The United States Social Security Administration (SSA) has made available data on the frequency of baby names from 1880 through the present.
As of this writing, the US Social Security Administration makes available data files, one per year, containing the total number of births for each sex/name combination. The raw archive of these files can be obtained here: http://www.ssa.gov/oact/babynames/limits.html